167 research outputs found

    SeqWare Query Engine: storing and searching sequence data in the cloud

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Since the introduction of next-generation DNA sequencers the rapid increase in sequencer throughput, and associated drop in costs, has resulted in more than a dozen human genomes being resequenced over the last few years. These efforts are merely a prelude for a future in which genome resequencing will be commonplace for both biomedical research and clinical applications. The dramatic increase in sequencer output strains all facets of computational infrastructure, especially databases and query interfaces. The advent of cloud computing, and a variety of powerful tools designed to process petascale datasets, provide a compelling solution to these ever increasing demands.</p> <p>Results</p> <p>In this work, we present the SeqWare Query Engine which has been created using modern cloud computing technologies and designed to support databasing information from thousands of genomes. Our backend implementation was built using the highly scalable, NoSQL HBase database from the Hadoop project. We also created a web-based frontend that provides both a programmatic and interactive query interface and integrates with widely used genome browsers and tools. Using the query engine, users can load and query variants (SNVs, indels, translocations, etc) with a rich level of annotations including coverage and functional consequences. As a proof of concept we loaded several whole genome datasets including the U87MG cell line. We also used a glioblastoma multiforme tumor/normal pair to both profile performance and provide an example of using the Hadoop MapReduce framework within the query engine. This software is open source and freely available from the SeqWare project (<url>http://seqware.sourceforge.net</url>).</p> <p>Conclusions</p> <p>The SeqWare Query Engine provided an easy way to make the U87MG genome accessible to programmers and non-programmers alike. This enabled a faster and more open exploration of results, quicker tuning of parameters for heuristic variant calling filters, and a common data interface to simplify development of analytical tools. The range of data types supported, the ease of querying and integrating with existing tools, and the robust scalability of the underlying cloud-based technologies make SeqWare Query Engine a nature fit for storing and searching ever-growing genome sequence datasets.</p

    SeqWare Query Engine: storing and searching sequence data in the cloud

    Get PDF
    Abstract Background Since the introduction of next-generation DNA sequencers the rapid increase in sequencer throughput, and associated drop in costs, has resulted in more than a dozen human genomes being resequenced over the last few years. These efforts are merely a prelude for a future in which genome resequencing will be commonplace for both biomedical research and clinical applications. The dramatic increase in sequencer output strains all facets of computational infrastructure, especially databases and query interfaces. The advent of cloud computing, and a variety of powerful tools designed to process petascale datasets, provide a compelling solution to these ever increasing demands. Results In this work, we present the SeqWare Query Engine which has been created using modern cloud computing technologies and designed to support databasing information from thousands of genomes. Our backend implementation was built using the highly scalable, NoSQL HBase database from the Hadoop project. We also created a web-based frontend that provides both a programmatic and interactive query interface and integrates with widely used genome browsers and tools. Using the query engine, users can load and query variants (SNVs, indels, translocations, etc) with a rich level of annotations including coverage and functional consequences. As a proof of concept we loaded several whole genome datasets including the U87MG cell line. We also used a glioblastoma multiforme tumor/normal pair to both profile performance and provide an example of using the Hadoop MapReduce framework within the query engine. This software is open source and freely available from the SeqWare project (http://seqware.sourceforge.net). Conclusions The SeqWare Query Engine provided an easy way to make the U87MG genome accessible to programmers and non-programmers alike. This enabled a faster and more open exploration of results, quicker tuning of parameters for heuristic variant calling filters, and a common data interface to simplify development of analytical tools. The range of data types supported, the ease of querying and integrating with existing tools, and the robust scalability of the underlying cloud-based technologies make SeqWare Query Engine a nature fit for storing and searching ever-growing genome sequence datasets

    Elevated levels of diesel range organic compounds in groundwater near Marcellus gas operations are derived from surface activities

    Get PDF
    Author Posting. © The Author(s), 2015. This is the author's version of the work. It is posted here by permission of National Academy of Sciences for personal use, not for redistribution. The definitive version was published in Proceedings of the National Academy of Sciences of the United States of American 112 (2015): 13184-13189, doi: 10.1073/pnas.1511474112 .Hundreds of organic chemicals are utilized during natural gas extraction via high volume hydraulic fracturing (HVHF). However, it is unclear if these chemicals, injected into deep shale horizons, reach shallow groundwater aquifers and impact local water quality, either from deep underground injection sites or from the surface or shallow subsurface. Here, we report detectable levels of organic compounds in shallow groundwater samples from private residential wells overlying the Marcellus Shale in northeastern Pennsylvania. Analyses of purgeable and extractable organic compounds from 64 groundwater samples revealed trace levels of volatile organic compounds, well below the Environmental Protection Agency’s maximum contaminant levels, and low levels of both gasoline range (GRO; 0-8 ppb) and diesel range organic compounds (DRO; 0-157 ppb). A compound-specific analysis revealed the presence of bis(2-ethylhexyl)phthalate, which is a disclosed HVHF additive, that was notably absent in a representative geogenic water sample and field blanks. Pairing these analyses with 1) inorganic chemical fingerprinting of deep saline groundwater, 2) characteristic noble gas isotopes, and 3) spatial relationships between active shale gas extraction wells and wells with disclosed environmental health and safety (EHS) violations, we differentiate between a chemical signature associated with naturally occurring saline groundwater and a one associated with alternative anthropogenic routes from the surface (e.g., accidental spills or leaks). The data support a transport mechanism of DRO to groundwater via accidental release of fracturing fluid chemicals derived from the surface rather than subsurface flow of these fluids from the underlying shale formation.The authors thank Duke University’s Pratt School of Engineering and the National Science Foundation’s CBET Grant Number 1336702 and NSF EAGER (EAR-1249255) for financial support.2016-04-1

    Impacts of coastal infrastructure on shoreline response to major hurricanes in southwest Louisiana

    Get PDF
    © The Author(s), 2022. This article is distributed under the terms of the Creative Commons Attribution License. The definitive version was published in Cadigan, J., Bekkaye, J., Jafari, N., Zhu, L., Booth, A., Chen, Q., Raubenheimer, B., Harris, B., O’Connor, C., Lane, R., Kemp, G., Day, J., Day, J., & Ulloa, H. Impacts of coastal infrastructure on shoreline response to major hurricanes in southwest Louisiana. Frontiers in Built Environment, 8, (2022): 885215. https://doi.org/10.3389/fbuil.2022.885215.The Rockefeller Wildlife Refuge, located along the Chenier Plain in Southwest Louisiana, was the location of the sequential landfall of two major hurricanes in the 2020 hurricane season. To protect the rapidly retreating coastline along the Refuge, a system of breakwaters was constructed, which was partially completed by the 2020 hurricane season. Multi-institutional, multi-disciplinary rapid response deployments of wave gauges, piezometers, geotechnical measurements, vegetation sampling, and drone surveys were conducted before and after Hurricanes Laura and Delta along two transects in the Refuge; one protected by a breakwater system and one which was the natural, unprotected shoreline. Geomorphological changes were similar on both transects after Hurricane Laura, while after Delta there was higher inland sediment deposition on the natural shoreline. Floodwaters drained from the transect with breakwater protection more slowly than the natural shoreline, though topography profiles are similar, indicating a potential dampening or complex hydrodynamic interactions between the sediment—wetland—breakwater system. In addition, observations of a fluidized mud deposit in Rollover Bayou in the Refuge are presented and discussed in context of the maintenance of wetland elevation and stability in the sediment starved Chenier Plain.Funding for the study has been partially provided by the National Science Foundation through grants NSF 2139882, 2139883, 1829136, 1848650, and 1939275, as well as through the United States Army Corps of Engineers Regional Sediment Management program. Student support provided through the National Science Foundation Graduate Research Fellowship Program and the Louisiana Coastal Science Assistantship Program

    Sensory Electrical Stimulation Improves Foot Placement during Targeted Stepping Post-Stroke

    Get PDF
    Proper foot placement is vital for maintaining balance during walking, requiring the integration of multiple sensory signals with motor commands. Disruption of brain structures post-stroke likely alters the processing of sensory information by motor centers, interfering with precision control of foot placement and walking function for stroke survivors. In this study, we examined whether somatosensory stimulation, which improves functional movements of the paretic hand, could be used to improve foot placement of the paretic limb. Foot placement was evaluated before, during, and after application of somatosensory electrical stimulation to the paretic foot during a targeted stepping task. Starting from standing, twelve chronic stroke participants initiated movement with the non-paretic limb and stepped to one of five target locations projected onto the floor with distances normalized to the paretic stride length. Targeting error and lower extremity kinematics were used to assess changes in foot placement and limb control due to somatosensory stimulation. Significant reductions in placement error in the medial–lateral direction (p = 0.008) were observed during the stimulation and post-stimulation blocks. Seven participants, presenting with a hip circumduction walking pattern, had reductions (p = 0.008) in the magnitude and duration of hip abduction during swing with somatosensory stimulation. Reductions in circumduction correlated with both functional and clinical measures, with larger improvements observed in participants with greater impairment. The results of this study suggest that somatosensory stimulation of the paretic foot applied during movement can improve the precision control of foot placement

    Candidate biomarkers of PARP inhibitor sensitivity in ovarian cancer beyond the BRCA genes

    Get PDF
    BACKGROUND: Olaparib (Lynparza™) is a PARP inhibitor approved for advanced BRCA-mutated (BRCAm) ovarian cancer. PARP inhibitors may benefit patients whose tumours are dysfunctional in DNA repair mechanisms unrelated to BRCA1/2. We report exploratory analyses, including the long-term outcome of candidate biomarkers of sensitivity to olaparib in BRCA wild-type (BRCAwt) tumours. METHODS: Tumour samples from an olaparib maintenance monotherapy trial (Study 19, D0810C00019; NCT00753545) were analysed. Analyses included classification of mutations in genes involved in homologous recombination repair (HRR), BRCA1 promoter methylation status, measurement of BRCA1 protein and Myriad HRD score. RESULTS: Patients with BRCAm tumours gained most benefit from olaparib; a similar treatment benefit was also observed in 21/95 patients whose tumours were BRCAwt but had loss-of-function HRR mutations compared to patients with no detectable HRR mutations (58/95). A higher median Myriad MyChoice® HRD score was observed in BRCAm and BRCAwt tumours with BRCA1 methylation. Patients without BRCAm tumours derived benefit from olaparib treatment vs placebo although to a lesser extent than BRCAm patients.CONCLUSIONS: Ovarian cancer patients with tumours harbouring loss-of-function mutations in HRR genes other than BRCA1/2 may constitute a small, molecularly identifiable and clinically relevant population who derive treatment benefit from olaparib similar to patients with BRCAm

    Multimodal characterization of the late effects of traumatic brain injury: a methodological overview of the Late Effects of Traumatic Brain Injury Project

    Get PDF
    Epidemiological studies suggest that a single moderate-to-severe traumatic brain injury (TBI) is associated with an increased risk of neurodegenerative disease, including Alzheimer’s and Parkinson’s disease (AD and PD). Histopathological studies describe complex neurodegenerative pathologies in individuals exposed to single moderate-to-severe TBI or repetitive mild TBI, including chronic traumatic encephalopathy (CTE). However, the clinicopathological links between TBI and post-traumatic neurodegenerative diseases such as AD, PD, and CTE remain poorly understood. Here we describe the methodology of the Late Effects of TBI (LETBI) study, whose goals are to characterize chronic post-traumatic neuropathology and to identify in vivo biomarkers of post-traumatic neurodegeneration. LETBI participants undergo extensive clinical evaluation using National Institutes of Health TBI Common Data Elements, proteomic and genomic analysis, structural and functional MRI, and prospective consent for brain donation. Selected brain specimens undergo ultra-high resolution ex vivo MRI and histopathological evaluation including whole mount analysis. Co-registration of ex vivo and in vivo MRI data enables identification of ex vivo lesions that were present during life. In vivo signatures of postmortem pathology are then correlated with cognitive and behavioral data to characterize the clinical phenotype(s) associated with pathological brain lesions. We illustrate the study methods and demonstrate proof of concept for this approach by reporting results from the first LETBI participant, who despite the presence of multiple in vivo and ex vivo pathoanatomic lesions had normal cognition and was functionally independent until her mid-80s. The LETBI project represents a multidisciplinary effort to characterize post-traumatic neuropathology and identify in vivo signatures of postmortem pathology in a prospective study

    Climate drives the geography of marine consumption by changing predator communities

    Get PDF
    Este artículo contiene 7 páginas, 3 figuras, 1 tabla.The global distribution of primary production and consumption by humans (fisheries) is well-documented, but we have no map linking the central ecological process of consumption within food webs to temperature and other ecological drivers. Using standardized assays that span 105° of latitude on four continents, we show that rates of bait consumption by generalist predators in shallow marine ecosystems are tightly linked to both temperature and the composition of consumer assemblages. Unexpectedly, rates of consumption peaked at midlatitudes (25 to 35°) in both Northern and Southern Hemispheres across both seagrass and unvegetated sediment habitats. This pattern contrasts with terrestrial systems, where biotic interactions reportedly weaken away from the equator, but it parallels an emerging pattern of a subtropical peak in marine biodiversity. The higher consumption at midlatitudes was closely related to the type of consumers present, which explained rates of consumption better than consumer density, biomass, species diversity, or habitat. Indeed, the apparent effect of temperature on consumption was mostly driven by temperature-associated turnover in consumer community composition. Our findings reinforce the key influence of climate warming on altered species composition and highlight its implications for the functioning of Earth’s ecosystems.We acknowledge funding from the Smithsonian Institution and the Tula Foundation.Peer reviewe

    Gut Flora Metabolism of Phosphatidylcholine Promotes Cardiovascular Disease

    Get PDF
    Metabolomics studies hold promise for the discovery of pathways linked to disease processes. Cardiovascular disease (CVD) represents the leading cause of death and morbidity worldwide. Here we used a metabolomics approach to generate unbiased small-molecule metabolic profiles in plasma that predict risk for CVD. Three metabolites of the dietary lipid phosphatidylcholine—choline, trimethylamine N-oxide (TMAO) and betaine—were identified and then shown to predict risk for CVD in an independent large clinical cohort. Dietary supplementation of mice with choline, TMAO or betaine promoted upregulation of multiple macrophage scavenger receptors linked to atherosclerosis, and supplementation with choline or TMAO promoted atherosclerosis. Studies using germ-free mice confirmed a critical role for dietary choline and gut flora in TMAO production, augmented macrophage cholesterol accumulation and foam cell formation. Suppression of intestinal microflora in atherosclerosis-prone mice inhibited dietary-choline-enhanced atherosclerosis. Genetic variations controlling expression of flavin monooxygenases, an enzymatic source of TMAO, segregated with atherosclerosis in hyperlipidaemic mice. Discovery of a relationship between gut-flora-dependent metabolism of dietary phosphatidylcholine and CVD pathogenesis provides opportunities for the development of new diagnostic tests and therapeutic approaches for atherosclerotic heart disease
    corecore